Speaker Matching A Third Professional Project

نویسنده

  • Andrew J. Mason
چکیده

This report examines a problem faced by Axent Audio Products Limited, a New Zealand speaker manufacturer, in forming stereo speaker pairs from a batch of similar speakers in such a way that the sound quality of the pairs is maximised. The applications of the maximum cardinality algorithm and sum matching algorithm to this problem are discussed in detail. A new heuristic is developed for matching with a lexicographic objective function, and results of applying the heuristic are given.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Advances on HMM-based text-dependent speaker verification

This paper presents recent development on text-dependent speaker verification technology in EU project PICASSO, which have improved the SV performance significantly. In the project we adopt HMM approach for pattern matching. In the paper we describes four different techniques, adaptive variance flooring, multiple use of enrolment sample, generalised competitive measurement for score normalisati...

متن کامل

Text-Independent Speaker Identification

Speaker identification is a difficult task, and the task has several different approaches. The state of the art for speaker identification techniques include dynamic time warped(DTW) template matching, Hidden Markov Modeling(HMM), and codebook schemes based on vector quantization(VQ)[2]. In this project, the vector quantization approach will be used, due to ease of implementation and high accur...

متن کامل

Text-Independent Speech Recognition

Introduction Speaker identification is an area with many different applications. The most practical uses can be found in areas such as security, surveillance, and automatic transcription in a multispeaker environment. The goal of this project is to understand the development and implementation of a text-independent speaker recognition system. We will analyze the system by interpreting feature m...

متن کامل

Speechdat multilingual speech databases for teleservices: across the finish line

The goal of the SpeechDat project is to develop spoken language resources for speech recognisers suited to realise voice driven teleservices. SpeechDat created speech databases for all official languages of the European Union and some major dialectal varieties and minority languages. The size of the databases ranges between 500 and 5000 speakers. In total 20 databases are recorded over the fixe...

متن کامل

Frame-level Nonlinearity for Robust DTW-based Speaker Verification

Dynamic time warping (DTW) is a successful algorithm in many matching and searching tasks. For the text-dependent speaker verification, it is still an appropriate choice when enrollment data are very limited. Yet DTW is very sensitive to the endpoint variations between the reference template and test examples. Most research reported on this issue is mainly in two directions: robust endpoint det...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998